منابع مشابه
Learning Grimaces by Watching TV
Differently from computer vision systems which require explicit supervision, humans can learn facial expressions by simply observing other humans in their environment. In this paper, we consider the problem of developing similar capabilities in machine vision. As a starting point, we look at the problem of relating facial expressions to objectively measurable events occurring in videos and make...
متن کاملLearning to Estimate Pose by Watching Videos
In this paper we propose a technique for obtaining coarse pose estimation of humans in an image that does not require any manual supervision. While a general unsupervised technique would fail to estimate human pose, we suggest that sufficient information about coarse pose can be obtained by observing human motion in multiple frames. Specifically, we consider obtaining surrogate supervision thro...
متن کاملLearning Image Matching by Simply Watching Video
This work presents an unsupervised learning based approach to the ubiquitous computer vision problem of image matching. We start from the insight that the problem of frame-interpolation implicitly solves for inter-frame correspondences. This permits the application of analysisby-synthesis: we firstly train and apply a Convolutional Neural Network for frame-interpolation, then obtain corresponde...
متن کاملLearning by watching: extracting reusable task knowledge from visual observation of human performance
|A novel task instruction method for future intelligent robots is presented. In our method, a robot learns reusable task plans by watching a human perform assembly tasks. Functional units and working algorithms for visual recognition and analysis of human action sequences are presented. The overall system is model based and integrated at the symbolic level. Temporal segmentation of a continuous...
متن کاملTIVA Learning to Recognize Speech by Watching Television
Imagine a computer plugged into your television at home or in a foreign hotel. In the morning, it can barely transcribe one out of two words correctly, but by evening, it can provide a largely correct transcript of the evening news show, based on what it learned during the day from the TV. This seems like the core of familiar science fiction shows, but the research described here brings this vi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEJ Transactions on Electronics, Information and Systems
سال: 1997
ISSN: 0385-4221,1348-8155
DOI: 10.1541/ieejeiss1987.117.9_1331